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ABSTRACT 

Earth-like planets are expected to provide the greatest opportunity for the detection of life 
beyond the Solar System. However our planet cannot be considered a fair sample, especially 
if intelligent life exists elsewhere. Just as a person’s country of origin is a biased sample 
among countries, so too their planet of origin may be a biased sample among planets. The 
magnitude of this effect can be substantial: over 98% of the world’s population live in a 
country larger than the median. In the context of a simple model where the mean population 
density is invariant to planet size, we infer that a given inhabited planet (such as our nearest 
neighbour) has a radius r < 1.2r0 (95% conhdence bound). We show that this result is likely 
to hold not only for planets hosting advanced life, but also for those which harbour primitive 
life forms. 

Further inferences may be drawn for any variable which influences population size. For 
example, since population density is widely observed to decline with increasing body mass, 
we conclude that most intelligent species are expected to exceed 300 kg. 

Key words; astrobiology - methods: statistical - planets and satellites: detection. 


1 INTRODUCTION 


The discovery of extra-terrestrial life stands as one of the most am¬ 
bitious objectives in modern scientific endeavour. Over one thou¬ 
sand distant planets have now been identi fied, spanning a broad 
spect r um of sizes and orb i tal configurations |Ricel2014l : IPene et alj 
I 2 OI ll : ISonfils et al]l2013l : [Petigura et al.ll2013 ). Since many more 


await detection, only a small fraction can be subject to detailed 
follow-up investigations. We must therefore identify those deemed 
most likely to host life. At present the Earth is our only example of 
an inhabited planet, so its physical characteristics appear to provide 
a natural template for finding life elsewhere. However, as we shall 
see, selection effects may have biased our observational sample. 

It has often been postulated that our existence in the Uni¬ 
verse could explain the magnitudes of various quantities in fun¬ 
damental physics, such as the fine-structure constant, the cosmo- 
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ditions exists, we should expect to observe those which permit the 
emergence of life. Or more specifically, those which maximise the 
abundance of life. While this work will follow a similar line of rea¬ 
soning, our approach differs from most in that there is no require¬ 
ment for an ensemble of universes to exist. 

The physical characteristics of the Eart h are considered to 
be the gold standard for habitability (see e.g. iKasting 'Zl l 19931: 
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ISchulze-Makuch et alj20llh . However, for any non-singular distri¬ 
bution of population sizes, typical beings do not live within typical 
(median) populations. This is a statistical truism, yet it ensures that 
a violation of the mediocrity principle is inevitable. That we should 
ex pect to be a p art of a large civilisation has been stated previously 
by iGotj ( I 1993 I) . Here we elaborate on this by considering the bias 
induced on observables which influence population size, such as 
the size of the host planet. We shall also generalise this result to in¬ 
corporate the broader collection of planets which harbour primitive 
life forms. 


2 POPULATION BIAS 

In the absence of any extra information, the probability of belong¬ 
ing to a particular group is proportional to the total membership 
of that group. This selection effect is apparent in numerous per¬ 
sonal characteristics: your blood type, your class size at school, 
your employer, and your geographic location. It seems unremark¬ 
able to note that you are more likely to have a common blood type 
than a rare one, or that you are more likely to be living in China than 
the Cayman Islands. But if mankind’s colonisation had spread be¬ 
yond multiple continents to include another planet, the likelihood 
of belonging to this second planet must be weighted in the same 
manner. Of particular importance to this work is the following sup¬ 
position: if the second colony had not arrived from Earth but in¬ 
stead evolved independently, it appears difficult to justify why our 
calculation should no longer hold. 

Throughout this work we shall assume that the Universe hosts 
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Core Radius [r/r^^] 

Figure 1. Constraints on the radius r of an inhabited planet, based on a 
constant mean population density, and after marginalising over the fj, and a 
parameters of the lognormal distribution. The thick and thin solid lines cor¬ 
respond to the higher and lower set of cr values. The dashed line illustrates 
the effect of imposing the condition r > 0.5^0, as may be the requirement 
for an atmospheric water cycle. A common feature is the manner in which 
super-Earths are disfavoured, regardless of the choice of prior. Broad and 
narrow lines yield 95% confidence bounds of r < O.9r0 and r < 1.2r0 
respectively, while the dashed line sets r < 1.4r0. For reference, the radii 
of Europa and Mars are O.25r0 and O.53r0 respectively. 


an ensemble of planets with advanced civilisations. We define an 
advanced civilisation as a population of observers which has (a) 
colonised most of its host planet/moon and (b) developed sufficient 
intelligence to contemplate the existence of other inhabited planets. 
It is helpful to express the total number of observers N produced 
by a given civilisation in the form 


Here L is the longevity of the civilisation, over which time it sus¬ 
tains a mean population size x. The mean lifespan averaged over 
all individuals is denoted by R. No assumption is made regarding 
the temporal variation of either the population size or birth rate. If 
the ensemble of civilisations is suitably large we may consider a 
continuous probability distribution for the set of civilisation char¬ 
acteristics C = {x, L,R,...}. This encompasses a broad range of 
variables, including the physical characteristics of the host planet, 
such as its radius, density, and atmospheric properties. 

We wish to determine how a civilisation selected at random, 
such as our nearest neighbour, compares to our own. Our nearest 
neighbour is drawn from the ensemble of civilisations which exist 
today, p{C\T), at a fixed time T. The Earth is drawn from a dif¬ 
ferent distribution, p{C\I), the ensemble of civilisations as sam¬ 
pled by a random observer, I. Using Bayes’ theorem, p(C|T') oc 
Lp{C), since p{T\C) oc L, demonstrating that we are more 
likely to co-exist with long lived civilisations. Similarly, we find 
p(C|7) oc Np{C). Substituting ([TJ, these two distributions are 
then related by p(C|7) oc ■^p(C|r). In order to determine the 
probability distribution for a single element 9 € C, we marginalise 
over all other parameters, yielding 


p{e\i)cxp{e\T)E 



( 2 ) 


where the conditional expectation is defined as follows 




e,T^= jj ^p{x,R\e,T)dxAR. (3) 


This is a general result, which makes no assumptions regarding the 
functional form of p{x). If the expectation E is sensi¬ 

tive to the value of 9, then p(9\I) will differ from p{9\T).\n other 
words, provided the mean population of advanced civilisations is 
correlated with any planetary characteristic, then the Earth is a bi¬ 
ased sample among inhabited planets. This is the central result of 
this work. We now consider the particular case where the generic 
parameter 9 is taken to be the planet radius r. 


3 PLANETARY SELECTION EFFECTS 

The distribution o f planetary radii ha s now been measured down 
to terrestrial sizes. Isilburt et al.l (l2015h found that across the range 
1 < f/re < 4, the distribution p{r) appears to be approximately 
constant in log space. This was inferred from the radii of planets 
discovered by the Kepler spacecraft, taking into account the various 
observational selection effects. However the distribution of plane¬ 
tary radii which host life remains highly uncertain. 

We seek to estimate the radius of another inhabited planet, 
such as our nearest neighbour, which is drawn from p(r\T). The 
Earth’s radius is our data point, representing a single sample drawn 
from p(r|7). Ordinarily a solitary sample is insufficient to draw 
any conclusions about the range of the parent distribution, since an 
estimate of the variance requires at least two points. However, when 
biased, a single sample can be sufficient to impose a useful bound. 
Eor example, rolling a die once does not reveal any information 
on the values of the other faces. But if the die is loaded, such that 
each face’s probability is proportional to its value, then we can infer 
from a single throw that the other faces have values lower or very 
similar to the observed face. 

It would be unreasonable to expect the mean population of in¬ 
habited planets to remain constant for different radii. The available 
surface area increases, as does the total amount of available energy. 
Eor small perturbations in planetary radius we should expect the 
mean population density to remain approximately constant. This 
suggests the expected population size rises as E{x\r,T) oc F. 
This scaling relation implies that the mean observed radius pi is 
related to the mean radius p by 

/ 2 ( 7 ^ + \ 

where and ps are the variance and third moment of the distribu¬ 
tion of inhabited radii. Eor any non-singular distribution > 0), 
observers should expect to find themselves on a larger planet than 
if the planet had been selected at random from the ensemble of in¬ 
habited planets. The broader the distribution of radii, the stronger 
the observational bias. 

To progress quantitively, we marginalise over a family of pos¬ 
sible distributions for p(r|T). The functional form is not particu¬ 
larly important, although it should ensure p{r\T) falls to zero for 
very small and very large values of r. Here we adopt a Gaussian dis¬ 
tribution in log space, \nj\f{pr,a-r), marignalising over its mean 
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and variance. The full posterior likelihood p{r\D, T) of inhabited 
planetary radii given our data D (the radius of the Earth) is then 

p{r\D,T) oc jj p{D\pLr,(yr)p{r\pr,(yr)'n:{pr,(yl)ApLrdal . 

(5) 

where we adopt a reference prior oc cr”^. 

Figure [T] shows the likelihoods resulting from two different 
priors on Or- The thin solid line corresponds to a range of narrow 
distributions 0.05 < ar < 0.2. For these low values of Or, the 
total distribution is narrow so radii far from the Earth are always 
disfavoured, and we find r < 1.2r0 (95% confidence bound). The 
thick solid line spans 0.2 < cjr < 0.8, setting a tighter bound r < 
O.Or®. At these larger values of ar, there is a strong selection effect 
at work, as quantified by ©. This favours small planets, leaving 
the Earth to appear highly atypical. More extreme values of ar, 
either larger or smaller than those considered here, only serve to 
tighten these bounds on r further. The extent to which the Earth 
overestimates the radii of other inhabited planets strongly depends 
on the choice of prior. However the conclusion that larger radii are 
disfavoured is robust to the choice of prior on ar- 

Thus far we have deliberately neglected criteria which are con¬ 
ventionally assumed to influence habitability. If the emergence of 
life needs water, we can set a more stringent limit on the range of 
habitable radii. The smallest planets are not expected to be able to 
retain a thick enough atmosphere to sustain a water cycle. Trun¬ 
cating the posterior likelihood generated from the broad prior such 
that p{r < O.Sr®) = 0 leads to a modest amplification of the like¬ 
lihood at larger radii, as shown by the dashed line. In this case we 
find the 95% confidence bound to be r < 1.4r0. 

These results involve marginalising over a range of possible 
distributions. Figurej^shows the specific case where ar = 0.4 and 
Pr = 1. The contours reflect the 68% and 95% confidence limits 
for the planet-sampled (solid) and observer-sampled (dashed) dis¬ 
tributions. At the smallest radii the distributions are truncated due 
to atmospheric mass loss. The planet inhabited by the typical ob¬ 
server is over five times larger than the median planet size. The 
magnitude of this bias is predominantly determined by the breadth 
of the distribution, ar- 

Note that we have only estimated the relative abundance of 
inhabited planets, and make no statement regarding their overall 
prevalence in the Universe. These results are insensitive to the dis¬ 
tribution of population sizes (it need not be lognormal), to the vari - 
ables which appear in the Drake equation dPrake & Sobeliri992h . 
and to the numerous variables which influence population size, pro¬ 
vided they remain uncon'elated with planet size. 


4 THE DISTRIBUTION OF PRIMITIVE LIFE 

Thus far our calculation applies to planets harbouring advanced life 
forms. How does this relate to the question of where primitive life 
resides? Primitive life-forms are a pre-requisite for advanced life, 
and so their host planets must trace at least the same volume of pa¬ 
rameter space as those of intelligent species. The probability den¬ 
sity function for a planet hosting primitive life, p{r\P), may be 
expressed as 


p(r|P) oc-^p(r|T), (6) 

V{r) 

where rj{r) is the proportion of life-bearing planets which host ad¬ 
vanced life, as a function of radius r. If this function is constant, 



Figure 2. An illustration of the biases in both population and planet size 
which can be induced when sampling on a per-individual basis rather than 
a per-population basis. The solid contours represent where 68% and 95% 
of civilisations would be found in this model. The dashed contours repre¬ 
sent the distribution as sampled by an individual. In each case the median 
values are highlighted by dotted lines. In this example we calibrate the per- 
individual median to that of the Earth’s volume and estimated population of 
seven billion, yielding a typical civilisation of 15 million inhabiting a host 
planet approximately half the size of the Earth. 


ri{r) = Tjo, then the two distributions are identical. However larger 
biospheres host a wider range of species and environments, and a 
greater number of individual life-forms. For these reasons it ap¬ 
pears likely that larger planets possess a greater probability of pro¬ 
ducing advanced life from a primitive state. If we conservatively 
adopt p{r) oc r, the upper bound derived from the broad prior in 
Figure [T] shrinks to yield r < O.2r0. The truncated likelihood, 
as illustrated by the dashed line in Figure [T] tightens slightly to 
r < 1.2r0 (95% confidence bound). 

Even if mankind represents the only form of advanced life in 
the Universe, the fact that p{r) is likely to increase for larger values 
of r suggests that the Earth is probably larger than most life-bearing 
planets. 


5 CHARACTERISTICS OF ADVANCED SPECIES 

There are likely to be a number of other variables, aside from the 
size of the host planet, which are subject to selection bias due 
to their influence on population size. For example, species with 
a lower body mass are able to sustain a higher population den¬ 
sity. This is a trend whic h has been extensively observed through¬ 
out th e animal kingdom (lDamuthlll98ll . [l987f : lLoeuille & Loreaul 
l2006li . One proposed mechanism originates from Kleiber’s law, 
the scaling relation linking the basal metabolic rate (BMR) to the 
body mass rris, BMR oc llKleibeill932l : lAgutter & Wheatley! 
l2004h . Given the finite energy resources available, population den¬ 
sity drops as the individual’s energy demand rises. Ants and ter¬ 
mites vastly outnumber humans due to their small size. If mankind 
tried to match their population, our total metabolic demand would 
exceed the entire solar flux incident upon our planet. 

We adopt a scaling relation between mean population and 
body mass given by E{x\ms,T) oc Some variation in 

the value of this exponent has been observed jLoeuille & Loreaul 
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Body Mass [kg] 


Figure 3. One-dimensional constraint on the body mass of an intelligent 
species after marginalising over the two lognormal parameters. The thin 
dashed line represents our data point, at 70 kg, while the thick dashed line 
represents the median mass, at 314kg. This offset is predominantly due to 
the expected fall in population size with increasing body mass. For refer¬ 
ence the adult African Elephant is approximately 6,000 kg, while the heav¬ 
iest din osaurs such as Arg entinasaums were thought to be approximately 
10® kg ^Sellers et alJ2013h . 

l2006h . however these were generally found to be steeper relation¬ 
ships, thus our model is a c onservative one. I t is also the case tha t 
larger animals live longer iSpeakmanlbOOSl : iHulbert et alj|2007h . 
suggesting R oc vn^J'^. Therefore we estimate that the conditional 
expectation defined in Q is inversely proportional to body mass, 
E{x/R\ms,T) oc 

The distribution of body mas ses among species on E arth can 
be well described by a lognormal ([Greenwood et alJll996h . Again 
we marginalise over the two lognormal parameters with a refer¬ 
ence prior 7r(/rm,o'^) oc It would be extremely surpris¬ 

ing if the diversity in body mass among extra-terrestrial species 
is lower than that amongst a small group of closely related species 
on Earth. There are seven species of great ape, spanning gorillas, 
orangutans, humans, and chimpanzees. Their body masses exhibit 
an inter-species standard deviation in log space of cjm — 0.5, which 
serves as our lower bound. For an upper bound we adopt am = 3, 
so as not to greatly exceed the terrestrial variance. 

Figure illustrates the probability density function for body 
mass, as derived from our single data point of 70 kg. The median 
body mass is found to be 314 kg, while the 95% lower bound is 
given by rria > 25 kg. It is likely some correlation exists between 
planet size and median body mass, due to the influence of surface 
gravity, but we do not attempt to model this here. 

In order to further explore the range of variables which influ¬ 
ence the mean population size x, and are therefore susceptible to 
selection bias, we can decompose the contributing factors as fol¬ 
lows 

X = VbA^ , (7) 

where the physical factors are the available area A, and the en¬ 
ergy flux density at the surface Eg. The biological terms are the 
energy demand of the individual Ei, and the energy efficiency of 
the species rjb- If the highest populations are resource limited, this 
may suggest we are receiving an unusually high radiation flux Eg. 
This could arise either from being relatively close to our host star. 


or by possessing a lower atmospheric opacity. In red dwarf systems 
most of the incident radiation is in the infrared which could lead to 
considerably lower values for Eg. 


6 CONCLUSIONS 

On purely statistical grounds, any given individual should expect to 
be part of a larger group, not an ordinary group. Therefore, unless 
we are alone in the Universe, our planet is likely to be one which 
produces observers at a higher rate than most other inhabited plan¬ 
ets. This may be accomplished by having a relatively large popu¬ 
lation, in combination with a low individual life expectancy. Any 
variable which correlates with population size will also be subject 
to observational bias. By adopting a simple model where the mean 
population density is constant, we find that an inhabited planet se¬ 
lected at random has a radius r < 1.2r^ (95% confidence bound). 

Our conclusions are not restricted to the search for intelligent 
life. Provided the emergence of advanced life from primitive forms 
is not more likely on smaller planets, then the upper bound r < 
1.2r® also applies to a planet hosting life of any kind. 

Two distinct methods are currently being pursued for finding 
life on ex oplanets: biomarke rs within the atmospheric spectra of ex¬ 
oplanets dHedelt et alj 20131) . and somewhat more spec ulatively, the 
recep tion of radio signals from advanced civilisations dTarter et alj 
I 2 OI ih . In each case the signal is extremely challenging to detect, 
and it is therefore vital to correctly prioritise the strongest candi¬ 
dates. Larger planets and larger populations might provide stronger 
signals. However since they are expected to be relatively scarce this 
gain may be offset by their greater distance from the Earth. 

Aside from the size of the host planet, there are a number 
of other variables which influence population size, and these are 
also subject to observational bias. Throughout the animal kingdom, 
species which are physically larger invariably possess a lower pop¬ 
ulation density, possibly due to their enhanced energy demands. 
As a result, we should expect humans to be physically smaller 
than most other advanced species. By marginalising over a feasi¬ 
ble range of standard deviations, we conclude that most species are 
expected to exceed 300 kg in body mass. The median body mass is 
similar to that of a polar bear. 

While larger species possess larger brains, the correlation be¬ 
tween brain size and intelligence is weak. Higher intelligence en¬ 
ables the development of technologies which can sustain larger 
population sizes. However it could also enable the longevity of in¬ 
dividuals to increase substantially, thereby pushing the selection 
bias in the opposite direction. The net effect remains unclear. 

The degree to which mankind and the Earth are atypical hinges 
on the level of diversity among advanced life forms. As we have 
repeatedly learned from the discoveries of distant planets, and the 
exploration of life on our own, nature is invariably more diverse 
than we anticipate, not less. 
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